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ABSTRACT 

We present a method to minimize, or even cancel out, the nuisance parameters affect- 
ing a measurement. Our approach is general and can be applied to any experiment 
or observation. We compare it with the bayesian technique used to deal with nui- 
sance parameters: marginalization, and show how the method compares and improves 
by avoiding biases. We illustrate the method with several examples taken from the 
astrophysics and cosmology world: baryonic acoustic oscillations, cosmic clocks, Su- 
pernova Type la luminosity distance, neutrino oscillations and dark matter detection. 
By applying the method we recover some known results but also find some interest- 
ing new ones. For baryonic acoustic oscillation (BAO) experiments we show how to 
combine radial and angular BAO measurements in order to completely eliminate the 
dependence on the sound horizon at radiation drag. In the case of exploiting SNla 
as standard candles we show how the uncertainty in the luminosity distance by a 
second parameter modeled as a metallicity dependence can be eliminated or greatly 
reduced. When using cosmic clocks to measure the expansion rate of the universe, we 
demonstrate how a particular combination of obscrvables nearly removes the metal- 
licity dependence of the galaxy on determining differential ages, thus removing the 
age-metallicity degeneracy in stellar populations. We hope that these findings will be 
useful in future surveys to obtain robust constraints on the dark energy equation of 
state. 

Key words: statistical methods; cosmology: theory 
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1 INTRODUCTION 

It is often the case that measurements in an experiment 
are hampered by systematic uncertainties or poorly known 
quantities that bias or increase the error on the experimen- 
tal quantity we wish to measure. The usual way to deal with 
these quantities which we will refer to as "nuisance param- 
eters", is to "marginalize" over them, i.e. in the bayesian 
framework, to integrate the likelilhood over the full range 
allowed by the parameter space of the nuisance parameters. 
This has the inconvenience that it makes the procedure ex- 
plicitily dependent on the choice of the prior, it is not guar- 
anteed to be unbiased and it is often non-optimal. A trivial 
case where marginalization will fail at providing the opti- 
mal solution is the following: imagine we obtain two mea- 
surements X and y that depend on two quantities of interest 
9i , 02 and on a nuisance parameter n in the following form 
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X — di/n and y = n^2 • It is obvious that the best way to re- 
move the nuisance parameter n is to form the ratio r = xy. 
This reduces the number of "data" but removes completely 
the dependence on the nuisance parameter. In real life it will 
not always be the case that one can completely eliminate the 
nuisance parameter because the functional form of the ob- 
scrvables might not allow one to do so, or because there are 
less observables than nuisance parameters. It would be of 
value to have a general prescription that describes whether 
the observables can be combined in such a way as to com- 
pletely eliminate one or more nuisance parameters or, if ex- 
act cancellation is not possible, which combination could 
minimize their impact on the final measurement. 

Today, mitigation of the effect of systematic uncertain- 
ties is a crucial issue. This is especially true in cosmology, 
where, in the era of precision cosmology brought about by 
the avalanche of data of the last decade, statistical errors 
keep shrinking, and the ultimate error-floor is often im- 
posed by systematics or nuisance parameters. In the liter- 
ature, however, there is no generic prescription to address 
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this issue besides marginalization. Still, results obtained by 
marginalization are often not independent of the system- 
atic effects as we will show in concrete cases. The state of 
the art in mitigating the effect of systematic uncertainties is 
represented by some specific examples worked out and ap- 
plied only to specific cases. For example |Song fc Percival] 
(20091 noted that two quantities related to the growth of 



cosmological structures that can be measured from galaxy 
redshift surveys are /3 = f/b and a8,gai ~ bag.m- Here 
/ = d In S/d In a is the logarithmic growth rate of structures 
and depends on key cosmological parameters that we want 
to measure, b is the galaxy bias and our nuisance parameter 
as it is poorly known, as^gai.m denotes the rms fluctuations 
of a sphere of 8Alpc/h for the galaxies and dark matter re- 
spectively and cosmological information through the growth 
factor is enclosed in crs.m- Similarly to our example above. 



Song & Percival ( 2009 1 suggested to use the combination 



/3a"g,g to remove the bias. In reality, when looking in de- 
tail at the equations governing redshift space distortions in 
galaxy redshift surveys, one finds that the full redshift-space 
power spectrum depends on different combinations of /, b 
and as^rn namely /erg,™, fba^,^, bas,m', thus the data them- 
selves may allow one to separate the various parameters. 
The use of ratios of observables to cancel systematics has 
been widespread. The list of examples also includes observ- 
ables with unequal energy ranges so that the ratio of ob- 
servables cancels the dependences on systematic errors in 
neutrino detectors (Villante, Fiorentini, fc Lisi||1999l or on 



the theoretical distribution of dark matter particles ( Fox et 
al. 2011a|bl; there are also specific methods that apply in 



some particular models of dark energy ( March et. al.|20TT |. 

Another well-known example is the standard procedure 
to minimize the effect of noise bias in Cosmic Microwave 
Background angular power spectra. The angular power spec- 
trum computed from the auto-correlation of a map produced 
by a given detector is given by 0^""" = Cr"'=-hC"°"'= where 
Qnoise jjgjjQ^gg ^Yie power spectrum of the detector noise and 
is called "noise bias" : a poor knowledge of the noise bias will 
bias the estimate of the CMB power spectrum. On the other 
hand the cross power spectrum for maps produced by dif- 
ferent, uncorrelated, detectors does not have noise bias. 

In any other context a new prescription must be worked 
out from scratch using detailed knowledge and intuition of 
the specific problem. 

The aim of this paper is to provide a general algo- 
rithm that minimizes, in an unbiased way, the impact of 
systematic uncertainties if they can be somewhat parame- 
terized and poorly known quantities on experimental mea- 
surements. The method is completely general and can be 
applied to any experiment where the nuisance parameters 
can be characterized (at least approximately). After illus- 
trating the approach and deriving the general prescription, 
we present a suite of applications, of different levels of com- 
plexity. Though our approach is general we will consider for 
definiteness cases for which the relevant quantities can be 
modeled as power-laws or linear functions. In order to illus- 
trate the method, we concentrate in some problems in as- 
tronomy and cosmology and show how the method reduces 
the impact of systematics. In particular we address the fol- 
lowing problems: 1) measurements of the expansion history 
of the universe using baryonic acoustic oscillations (BAO). 
Here we show how different measurement of the BAO scale 



at different redshifts can be optimally combined to cancel 
out uncertainties in the sound horizon scale. 2) Solar neu- 
trinos and the solar metallicity problem. We discuss how 
to better minimize the impact of uncertainties due to the 
solar abundances and due to theoretical inputs like opac- 
ities and diffusion. 3) Cosmic clocks and how to minimize 
the influence of systematic uncertainties in the metallicity of 
galaxies while estimating their age. 4) SN Type lA surveys 
to measure the expansion history of the universe and how to 
minimize the dependence of the Hubble diagram on a second 
parameter. We also recover some well-known results such as 
the cancellation of the noise bias in the measurement of the 
CMB angular power spectra and the cancellation of astro- 
physical inputs in dark matter observabations through the 
use of observables with adjusted energy ranges. 



2 A SYSTEMATIC APPROACH. ANALYSIS 
AND RECIPE 

We consider A'^ observables Oi,i = 1, ...,N that depend on m 
accurate or interesting quantities ^i, i = 1, ...,m and n "un- 
known" or "biased" nuisance quantities Vi, i = l,...,n, i.e. 
Oi(/ii, . . . , ^m, vi, . . . , Un). Our working assumption is that 
we are ignorant of the mean values of the nuisance parame- 
ters Uj and their errors {vjVi) and {/ijUi). Our goal is to find 
combinations of the observable quantities fk{Oi, ...,On), 
k — f,...,Af such that they are not affected by our igno- 
rance of the nuisance parameters. This requirement implies 
that a) the variance of fk is independent of the variance and 
covariance of the Ui, guaranteed by the lack of explicit de- 
pendence of fk on Ui and b) the mean value of ft should 
be independent of the mean value of the Vi, guaranteed by 
the null total derivative of fk with respect to the Ui. These 
conditions can be written explicitly as a set of first order 
partial differential equations 



fk 

dfk 
dvi 



fkiOi 

N 



E 



On); 

dfk do, ^ 
dOi dui 



for i = 1, . 



(1) 

(2) 



Note that even if the nuisance parameters have a distribu- 
tion which is very different from a Gaussian, the condition 
that fk have no explicit dependence on Ui guarantees that 
the correlation functions involving an arbitrary number of 
solutions fk, I.e. {fi^ ...f^") will be independent of any cor- 
relation functions involving the nuisance parameters Vi. If 
the distribution of the nuisance parameters is assumed to be 
Gaussian, it is enough to require that the partial derivative 
of fk with respect to Ui vanishes, dfk/dvi = for i = 1, ..., n, 
in order to guarantee that the correlation functions of fk are 
independent of correlations involving the nuisance param- 
eters. Furthermore, in the Gaussian case the correlations 
between the nuisance parameters and the solutions will van- 
ish (vifk) = due to the vanishing total derivative, but in 
the general non-Gaussian case they are different from zero 
{i^^.fk) + 0. 

A natural interpretation of the recipe defined by eq. ([2|) 



is the renormalization group equation (Wilson fc Kogut 



19741. In this framework, the nuisance unknown quantities 



Ui are interpreted as setting the renormalization group scale. 
Physical quantities fk{i^i,Oj{u)) are then defined as those 
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functionals of the observables that are invariant under arbi- 
trary changes of the values of the nuisance i.e as the ones 
satisfying the generalized renormalization group equation 



(ilr + ft.. 



d 
do. 



)/fc(i/,,o,H) = o 



(3) 



with Pi J — -g^ the corresponding /3 functions. In other 
words, by means of this recipe we identify the physics that 
is invariant under arbitrary rescalings of the nuisance quan- 
tities. This nuisance-independent physics is completely char- 
acterized by the nuisance scaling dimensions /3ij of the ob- 
servables. In the limit case where nuisance can be totally 
washed out this recipe will unravel the underlaying respon- 
sible scale mvariance. 

For definitiness, let us show the solutions for the case 
of N observable quantities Oi which are modeled by power 
laws of the n nuisance parameters Vj 



O, - O, 



5(M)n('^- 



(4) 



where g is some function of /2, a vector containing all other 
quantities on which the observables depend, Uij are known 
exponents and Oj is the true value of Uj, which does not 
necessarily need to be known. The solution to the system 
of the 2n first-order partial differential equations is a power 
law of the observables: 



fk = l[{o, - O.)"- 



(5) 



which leads for each fc to a system oi j = l,...,n linear 
algebraic equations for the z = 1, ..., A*' unknowns b^ . Clearly 
there are k = 1,...,M — N — n non-trivial independent 
solutions, 



E' 



,&f =0. 



(6) 



After removing the nuisance parameters, we are left with 
N — n observables. This means that we loose one observable 
per each nuisance parameter we eliminate and therefore that 
we can only eliminate A'"— 1 nuisance parameters. Compared 
to the case in which a prior is imposed on the nuisance 
parameters and one marginalizes over them, in this approach 
the resulting statistical errors on the fij are expected to 
increase. However we now obtain a set of observables which 
is independent of the systematics (and thus we do not rely 
on any choice of prior for the i/i). This is generally true 
and does not depend on our assumption that the relevant 
quantities can be modeled as power-laws. Note also that any 
combination of solutions to the differential equations, Eq. [2J 
is also a solution, we simply need to find all independent 
solutions. 

Exactly the same analysis can be repeated for the case 
in which the observables can be modeled by linear combina- 
tions of the nuisance parameters 



Oi -di= g{\i) + ^ aij(i 



(7) 



such that the solution to the system of first order differential 
equations is given by a linear combination of the observables 



/fc = ^&f(o.-6o 



(8) 



and we are once more led to a system of n linear algebraic 



equation for the A*' unknowns hi 
solutions. 



E' 



,fef = 0. 



with N — n non-trivial 



(9) 



In Sec. [3] we discuss applications of Eq. [6] valid for the 
linear and the power law cases and present the non triv- 
ial solutions in cases where we have more observables than 
nuisance parameters. 

Now we concentrate on problems with fewer observables 
than nuisance parameters. In this case there is no exact solu- 
tion for the system of Eq. [2] However, in some specific cases 
that are still of practical interest there may be approximate 
solutions. In particular, we aim at problems where observ- 
ables have similar (but not identical) dependences on some 
of the nuisance parameters (i.e., Uij ~ an for some i,j,l). 
The main difference from the exact treatment above will be 
that we cannot impose the full condition EqM but we should 
rather minimize the impact of systematics on the new ob- 
servables. If we shift a given nuisance parameter Ui by Aui 
the change in fk is approximately given by dfk/dvi Avi. The 
optimal fk should minimize the square of this quantity with 
respect to the parameters. We avoid the trivial solution by 
using Lagrange multipliers, which in the particular cases of 
observables modeled by Eq. Hlor Eq.JTl leads to the function 



Ck 



E 



d/fc 
Avi 



Ai/f - A, 



E(^^ 



A\ 



(10) 



where \k is a Lagrange multiplier to be solved for, Ak is the 
norm of the vector 6* , and Afj is the uncertainty on the 
j nuisance parameter. Let us start by studying the power- 
law case. In order to minimize this function, we vary it with 
respect to 6^ using the expression for fk as a power law of 
observables given in Eq. [5] which leads to the eigenvalue 
equation 



E^«^^-E 



A^E 



Ai^, 



aij aij 



fef = Afcfo*' . 



(11) 



where the first equality defines the matrix M. This is our 
central equation. The Lagrange multiplier A^, which will be 
the eigenvalues of the matrix M'li, measures how much the 
extremal (with respect to the nuisance parameters) solution 
fk is affected by th e nu isance parameters. This can be seen 
by multiplying Eq. 



11 



by hi , summing over i, and noticing 
that the left-hand side of the resulting expression is pre- 
cisely Y.Mh/Ayjf^ul thus {^fkf = \kT.Mf- The 
eigenvectors correspond to independent combinations of ob- 
servables, and, if we are interested in minimizing the impact 
of nuisance parameters, we should choose those eigenvectors 
corresponding to eigenvalues which are small with respect 
to typical entries of the matrix. In fact, the existence of a 
small eigenvalue is due to the similar dependence of the ob- 
servables on a particular nuisance parameter. 

We can easily verify that this description is more gen- 
eral and includes the previously discussed cases. If an eigen- 
value is zero, there is an independent non-trivial solution 
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which is unaffected by changes of the nuisance parameters. 
Indeed it can be easily seen from Eq]ll|that there will always 
he N — n null eigenvalues when n < N, and their eigen- 
vectors correspond to the solutions of Eq. [6] Other eigen- 
values are expected to be of the order of typical entries of 
the matrix A ~ f^a?, representing a solution for which the 
change under variations of the nuisance parameters is large 
A/^ ~ f^a^ ^^ &i . In the case where n > N, even if all the 
eigenvalues are different from zero, small eigenvalues of M^i 
correspond to solutions which change only slightly under 
shifts of the nuisance parameters and one can take these to 
be the approximately unbiased combinations. There will be 
as many independent approximately unbiased quantities as 
there are small eigenvalues of the matrix. These correspond 
to the instances where the nuisance parameters appear in 
Eqll] with powers which are almost the same for each Oi 
(aij ~ au). 

We can repeat the analysis for the linear case. Minimiz- 
ing the function defined in Eq. [lO] and using the expression 
for fk as a linear combination of observables we now obtain 
the following eigenvalue equation 



j:Mubf=j: 



^{Aiyjfaijaij 



j=i 



bf = Afcfe*' , 



(12) 



where now the matrix M has a different shape, but the same 
considerations as above apply, i.e. we look for small eigen- 
values and their corresponding eigenvectors will be solutions 
that are mildly influenced by the nuisance parameters. 

Let us now remark that, in some problems, the nuisance 
parameters or the observables are constrained to satisfy £ 
relations, say gi{vi, ...,i/n,Oi, ...,On) = 0, for i — !,...,£. 
This implies that they are not independent. The way to treat 
these cases is again through the use of Lagrange multipliers, 
such that when the nuisance parameters can be cancelled 
out exactly, the total derivative equation must be replaced 
by the following one 



/'=(Oi,...,Ojv)+^Q.5,(t/i,...,!/„,Oi,...,Oiv) =0,(13) 



di^i 



where ai are the Lagrange multipliers. We will see an ex- 
plicit example of this case in §( 3.6 1. The way to do it in the 
formalism of Eq. [lO] is that instead of minimizing the func- 
tion C from Eq. |10[ one has to include the constraints in the 
following way 



r. = E 



j=i 



dui 



Aui - A, 



E(^" 



At 



+ }aigi{iyi,...,u„,Oi,...,ON) 



(14) 



where at are the Lagrange multipliers used to impose the 
conditions Qi — 0. One then minimizes Ck with respect to 
hi, Afe, and m. 

Before ending this section, let us make an important 
comment on Eq. |10| If one imposes an infinite uncertainty 
on a nuisance parameter, say Avi — >■ cxa for example, typ- 
ical eigenvalues of the matrix M will be infinite. However, 
if there are combinations of observables that are completely 
independent of i/i, some eigenvalues will be finite since for 
those combinations dfk/dvi = 0, so that one should re- 



strict to the space spanned by eigenvectors corresponding 
to those finite eigenvalues. In this sense looking for combi- 
nations of observables which are completely independent of 
some nuisance parameter vi is equivalent to assigning an 
infinite uncertainty to vi. 



3 APPLICATIONS 

In this section we illustrate several cases where our method 
is applicable. We start with the study of baryon acoustic 
oscillations measurements where we assume that the sound 
horizon at radiation drag, which is common to all observa- 
tions, is the only quantity affected by systematic effects. In 
this case the dependence on the systematic can be cancelled 
out exactly. A similar problem is that of canceling astrophys- 
ical inputs in dark matter observations through the use of 
observables with adjusted energy ranges so that a nuisance 

( |2011a|b |. 



function can be cancelled, as done in Fox et al 



Next we consider a case where cancellation of the nuisance 
parameters cannot be exact: we will then find approximate 
unbiased combinations. This is the case for solar neutrino 
fiuxes, where the neutrino fiux predictions in detail depend 
on many nuisance parameters but the dependences are sim- 
ilar enough that approximate unbiased combinations can be 
found. 

In the spirit of increasing complexity, in the next exam- 
ple we consider a case where there is one systematic common 
to all observations and on top of that a correction to this 
main trend that differs from observation to observation in a 
well known way. The method will cancel out the dependence 
on the common systematic and minimaze that of the indi- 
vidual corrections. This is the case of the cosmic clocks. A 
similar case is that of SNe lA where the intrinsic magnitude 
of the SNe is unknown: its average value will be constant 
and in common to all SNe but its actual value depends on 
an hidden variable (probably metallicity of the host galaxy). 

To conclude we consider a case that at first sight may go 
beyond the scope of our method, where the interesting pa- 
rameters and the nuisance parameters are defined through 
correlation of the observables and not the observables them- 
selves. This is the example of the instrumental noise in CMB 
angular power spectrum, where the observables are the tem- 
perature fluctuations (or their spherical harmonic transform 
aim) but it is the angular power spectrum Ce that carries 
direct information on the cosmological parameters and is 
affected by noise bias. 



3.1 Measurement of the expansion history of the 
Universe with Baryon Acoustic Oscillations 

Baryonic acoustic oscilation experiments (BAO) will, even- 
tually, measure the acoustic scale in the radial and per- 
perdicular direction in the sky. Let us suppose that one has 
measurements of the BAOs at A^^ different redshifts Zi . The 
observables produced by the astronomical surveys are 



DAiz,) = DA{z^)/rs=Se/{l + z,) 
H(zi) = H{z,)rs=cAz, 



(15) 
(16) 



Here Da{z) denotes the angular diameter distance to red- 
shift z, H(z) the Hubble parameter at redshift z, 59 denotes 
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the angular measurement of the BAO scale and Az the ra- 
dial measurement; Va denotes the sound horizon at radiation 
drag, which is the BAO "standard ruler". The sound hori- 
zon at radiation drag is determined by CMB observations, 
for standard cosmologies is affected by a very small error 
but its estimate can be significantly bieised for non-standard 



models (e.g., Eisenstein & White (20041; de Bernardis et al 



( |2009[ ); |Mangim, Verde, fc Behran| (|2010l)). This considera- 
tion motivates us to consider Vs a nuisance parameter. From 
eq. H] we see that we can write fk as a power law of the 
observables 



fk 






(Zi 



n ^(^^ 



(17) 



3 = N^+l 



SO that the values of the exponents b^ will be given by solu- 
tions to the system of linear equations Eq.[6](or equivalently 
finding the kernel space of the matrix in Eq. |11[ ) , which we 
write explicitly for the particular case of two redshifts zi, Z2, 



(6^ + 6^-6^-6^)^=0, 
rs 



with solutions 

/i = DA{zr)H{zr), 

/2 = H{Z2)/H{Z,), 

h = Da[z2)/Da[z^) 



(18) 



(19) 
(20) 
(21) 



Any function of these quantities will also be a solution of the 
differential equation but will not contain new information. 
For example, suppose that one adds fi — Da{z2)H{z2) to 
the set of solutions; this will simply be a combination of the 
solutions we listed, indeed fi — f 1/2/3- For the case of two 
redshifts, we obtain only 4—1 independent quantities after 
eliminating our single nuisance parameter r^ from our four 
observables. 

For measurements of BAOs at N^ different redshifts, 
the 2Nz — 1 solutions will be given by all the ratios g — 
DA{zi)/DA{zi) and h — H{zi)/H{zi) plus the combination 
/ = Da{zi)H{zi). Note that although it may seem that we 
obtained that zi is the pivot scale, this is purely arbitrary 
as one may point out that the same quantities with any 
Zi as the pivot scale are also solutions of the differential 
equation. This is simply a consequence of the fact that any 
combination of our solutions is also a solution. Any pivot 
is equally good since the errors on the parameters will be 
independent of the choice of pivot under the assumption 
that they are Gaussian. 



(1 + zf{l - fix) +nx{l + 2)3(l+-0 + ™a)g-3»„z/(l+.) 



DAiz) 



(1 + ^) 



dz' 



H{z') 



1/2 



(23) 



Note that although i}x appears explicitly in these expres- 
sions, flatness has been assumeqM BAOs surveys measure 
combinations proportional to DA{z)/rs and H{z)rs. Us- 
ing the Fisher matrix formalism, we can forecast errors 



for these combinations (Seo & Eisenstein 20071 combin- 
ing measurements at 5 redshifts bins of width Az = 0.1 
for z < 0.5 for a survey with specifics similar to those 
of BOSS-SDSSIII Qind 15 redshifts bins at z > 0.5 for a 
survey with specifics similar to those of EUCLID rj We 
will compare three cases: keeping the full set of observa- 
tions DA{zi)/rs, H{zi)rs ("full"); keeping only the ratios to 
some pivot scale D A{zi) / D a{zp) , H{zi)/H{zp) as is some- 
times advocated ("ratios"); and our "unbiased" combina- 
tions DAizi)/DA{zp), H(zi)/H(zp), Da{zp)H{zp). For the 
sake of simplicity, we keep the cosmology fixed and take 
into account only errors on Ho and r^ . Using equations |23| 
and 23 one can use the Fisher matrix for DA{zi)/rs and 
H{zi)rs in order to compute the Fisher matrix (and errors) 
for wo,'Wa,rs,Ho. The results are given in table IT] In the 
"full" case we must assume a prior error on r^ (for example 
a CMB prior) which we take to have a 30% error since for 
some non-standard models the (systematic) error on rs can 
be estimated to be as large as that, see e.g., |Mangilli, Verde, | 
& Beltran|(|2010|). 



From the table we see that the statistical errors are al- 
ways smaller when using the "full" set as was to be expected 
(our method is throwing away some information in order to 
obtain quantities which are insensitive to systematic errors, 
this implies that the statistical errors must grow). Let us 
remark that when we consider measurements at 20 different 
redshifts as specified above, we obtain that the errors on wo 
and Wa change only by 0.4% and 1% respectively between 
the "full" and "unbiased" sets. Thus, one is insensitive to 
systematic errors on rs at the price of a modest increase 
in errors. The precise estimate of how much information is 
lost depends on how big the assumed uncertainty on Vs is 
(or how big the systematic errors are), as can be seen on 
the table for the case in which we assume a 2% error on 
Vs. Finally, note that when one has measurements at fewer 
redshifts the difference between the "ratios" and "unbiased" 
cases increases dramatically, this is due to the fact that one 
is throwing away one observable from a set of just a handful 
of them, thus loosing comparatively more information. 



3.1.1 Implications /or cosmological parameter estimation 

In this subsection we we use the above findings to forecast 
errors for the dark energy equation of state parameter wx {z) 
using the widely used parametrization wx = wo + Wai^ — a). 
If the reader is not interested in the details of this anal- 
ysis, she or he can skip to section |3.2| In terms of this 
parametrization, the expansion rate and luminosity distance 
can be written as 



H{z) 



l-fix 



(22) 



3.2 Dark Matter 

We discuss in this section another example of exact can- 
cellation of nuisance parameters, or in general of nuisance 
functions. In particular, we consider energy dependent ob- 
servables, which have been used to cancel the dependences 



^ Although Q,x + ^m = li the present Hubble rate, parametrized 
by h, is still a free parameter. We follow [Seo fc Eisenstein| | |2007[ l 
in using Hx and Vlmh^ to parametrize the cosmology dependence. 
^ http://www.sdss3.org/surveys/boss.php 
3 http://arxiv.org/abs/0912.0914 
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set 



Crjrs 



redshifts 



F^j'^{wo,Wa,Ox,^mh'^,rs) 



full 



0.3 



20 



/ 0.0579 



V - 



-0.215 
0.982 



-0.00801 
0.0371 
0.00146 



4.45 X 10"^ 

-2.87 X 10-s 

-1.14 X 10-^ 

1.5 X ID- 



'S 



0.981 

-6.34 

-0.252 

-0.000893 

57.6 



unbiased 



0.3 



20 



f 0.0584 -0.218 
1. 



v - 



-0.00813 
0.0379 
0.00149 



0. 

0. 

0. 

1.5 X lO-** 



0. 
0. 
0. 
0. 
2025. 



0.0587 



ratios 



0.3 



20 



-0.216 
1.01 



-0.00821 
0.0375 
0.00151 



0. 

0. 

0. 

1.5 X 10- 



V 



0. 
0. 
0. 
0. 
2025. 



/ 0.0435 



full 



0.02 



20 



-0.121 
0.38 



-0.0043 

0.0132 

0.000505 



0.0000136 
-0.0000877 
-3.49 X 10 
1.49 X 10- 



6 



6 



0.133 

-0.859 

-0.0341 

-0.000121 

7.8142 



0.198 



unbiased 



0.3 



10 



-0.472 
1.63 



-0.00305 

0.036 
0.00201 



0. 

0. 

0. 

1.5 X 10- 



V - 



0. 
0. 
0. 
0. 
2025. 



ratios 



0.3 



10 



f 0.21 -0.454 -0.00303 
1.66 0.0361 

0.00201 

V - - 



0. 

0. 

0. 

1.5 X 10- 



0. 
0. 
0. 
0. 
2025. 



/ 0.171 



full 



0.3 



20 



-0.738 
3.41 



-0.0285 

0.132 
0.00519 



2.05 X 10"^ 

-0.0000103 

-4.07 X 10"^ 

1.5 X 10"^ 



4.53 

-22.8 

-0.897 

-0.000842 

169 



unbiased 



0.3 



20 



/ 0.182 



V - 



-0.793 
3.69 



-0.0307 

0.143 
0.00563 



0. 

0. 

0. 

1.5 X lO""^ 



0. 
0. 
0. 
0. 
2025. 



Table 1. Fisher matrix analysis for the BAOs. We repeat the analysis for the cases in which the Fisher is computed from each the "full" 
set of observables {DA{zi)/rs, H{zi)rs}, the "ratios" {D A{zi) / D A{zp) , H {zi) / H [zp)} including external priors on Ts, and our optimally 
"unbiased" set {DA{zi)/DA{zp), H(zi)/H{zp), DA{zp)H(zp)}, as specified in the first column; we have not specified the pivot redshift 
Zp since any choice gives the same result. We have assumed a typical variance (Jr^/rs of 0.02 or 0.3, as specified in the second column. 
We show the case in which one has measurements at 20 redshifts up to redshift z « 2, and also the case in which one has measurements 
at 10 redshifts up to redshift z ^ 0.8, as stated in the third column. In the fourth column we give the full Fisher matrix with the order 
of the indexes given by 1 = wq ,2 = iOa,3 = Ox, 4 = flmh'^,5 = Ts, thus for example the element in the first row and second column of a 
matrix gives an estimate for the correlation between wq and Wa . The 2 X 2 submatrix for wq ,Wa in the unbiased and ratios cases never 
depends on the assumed variance for rs or Qmh^ by construction. For most of the cases we assumed a prior on Qx oi ctq^/Qx = 0.01, 
except for the las two rows. 



on systematic errors in neutrino detectors ( Villante, Fioren- 
|tini, fc Lisi||1999[ ) or on the theoretical distribution of dark 
matter particles ( Fox et al.|2011a|b l. 

Let us consider observables Oi{E) that depend on the 
integral of some function f{x), that we may not know, within 
some known range [xi,yi], as 



The function f{x) in the neutrino case is called the re- 
sponse function and contains the convolution of the neutrino 



cross section, detector resolution and neutrino flux ( Villante, 



Fiorentini, & Lisi 19991. The function f{x) in the case of 
dark matter detection corresponds to f{v)/\v\, the ratio of 
the dark matter velocity distribution to the dark matter 
speed when observing the recoil energy spectrum, and to 



Oi{Ermn < E < Emax) = g{^.i)F{Xi{Emi-n.) ,yi(Emax)) ■ (24) 
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the integral of that function times the dark matter speed 
when observing total rates ( Fox et aL]|2011a|b I . 

For simplicity, we concentrate on the case of two ob- 
servables Oi. and O2. Let's assume that the observables 
have a energy range overlap such that we can find ener- 
gies Ea,Eb,Ec and Ed such that x\{Ea) = X2{Ec) = x and 
yi{Ei) = y2{Ea) =y. 

Our observables can be rewritten as, 



Oi{Ea <E <Et) = g{tii)F{x,y) 
OiiEc <E <Ei)= g{ii2)F{x,y). 



(25) 
(26) 



Following the lines discussed in the previous section, the 
dependence on the nuisance function f{x, y) is removed by 
the combination 0\{Ea < E < Eb)/02{Ec < E < Ed), as 
advocated in the above references. 



3.3 Solar Neutrinos 

The Standard Solar Model (SSM) depends on ~ 20 input pa- 
rameters, including the solar age and luminosity, the opac- 
ity, the rate of diffusion, the zero-age abundances of key 
elements (He, C, N, O, Ne, Mg, Si, S, Ar, Fe), and the nu- 
clear cross sections (S-factors) for the pp chain and CN cycle 
reactions. The observable quantities in this case are the so- 
lar neutrino fluxes'] 0^, in particular the ^Be or *B neutrino 
fluxes, which have been independently measured by the so- 
lar neutrino detectors. The observed neutrino fluxes depend 
on elements abundances, solar structure parameters and on 
the S-factors, but the quantities of interest are actually the 
S-factors and the rest are poorly determined -nuisance- pa- 
rameters. 

To be more specific, the multi-dimensional set of vari- 
ations in SSM input parameters {Aj3j} from the SSM best 
values {/3j^^} often collapses to a one-dimensional depen- 
dence of the neutrino fluxes on the solar core temperature 
(Tc), where Tc is an implicit function of the variations {A/3j} 
I jBahcall fc Ulmer||1996[ ). The correlations between cj>i and 
Tc are strong but not exact. 

The sensitivity to parameter variations can be expressed 
in terms of the logarithmic partial derivatives a{i,j) eval- 
uated for each neutrino flux (pi and and each SSM input 
parameter /3j, 



a{i,j) 



where 



dln[(Pi/(P: 



SSMl 



d\n[l3j/pfMl 



(27) 



' and Pj denote the SSM best values. This in- 
formation, in combination with the assigned uncertainties in 
the (3j , then provides an estimate of the uncertainty in the 
SSM prediction of 4>i- Here we employ the logarithmic par- 



tial derivatives of Pena-Garay & Serenelli (20081; Serenelli, 



Haxton, & Pena-Garay (2011 1, corresponding to the higher 
metal composition (Grevesse & Sauval 1998) SSM, 



/('Be) 



('Be) 



^Be)S 



(28) 



rr3. 558^1. 267 ^0.786 



'A" 



D" 



[0-1.07 0-0.441 00.878 nO.O qO.O q-O.OOI] 
1^11 ^33 ^34 Sl7 Oe7 ^^114 I 



* The index i here labels the nuclear reaction that produces the 
neutrinos. 



r 0.004 0.002 0.053 0.044 0.057 0.116 0.083 0.014 0.217] 
IXq X]^ Xq Xnc a^Mg 2;si Xg XAj Xp^ I 



/('B) = 



("Sg^SSM 



(29) 



rr7. 130^2. 702 4I. 380 r^0.280l 



0^0 



\s^,^-'' S 



'33 

xn 



0.427 qO.846 
>334 



ol.U 0-1.0 o0.0051 
O17 Sg7 Oil 

r 0.025 0.007 0.111 0.083 0.106 0.211 0.151 0.027 0.510] 



^O 2;p,ic 



114 
1 



^Ar ^Fc 



where each parameter on the left-hand side represents a 
Pj/Pj ■ The luminosity, opacity, solar age, and the dif- 
fusion parameters are denoted by Lq, O, A, and D, while 
S and X denote S-factor and abundance ratios. The errors 
assigned to the solar model inputs are (0.4, 2.5, 0.44, 15.0, 
0.9, 4.3, 5.1, 7.5 2.0, 7.2, 29.7, 32.0, 38.7, 53.9, 11.5, 1 1.5, 
9.2, 49.6, 11.5)%( [Serenelh, Haxton, fc Pena-Garay|2011| ) for 
all the quantities Lq, O, A, D, Sj Xq in the order as they 
appear in Eqs.[29]-[3U] 

A new re-evaluation of the solar composition (|As-| 



plund, Grevesse, & Sauval 2005 1, which leads to significantly 
smaller abundances than previously estimated, has led to 
the lack of matching of helioseismological data and SSM 
predictions. It is therefore relevant to test observables that 
are weakly dependent both on solar composition and more 
theoretical inputs like diffusion and opacity. In our analysis, 
we consider Lq, O, A, and _D_and x as nuisance parame- 
ters. The matrix M^ of Eq. 



11 



for the two combination of 



observables /(^Be) and /(*B) is 



35.69 
74.42 



74.42 
156.34 



whose smaller eigenvalue (0.22) has the corresponding eigen- 
vector is proportional to (2.098, 1). Therefore the combina- 
tion of observables 



/(»B) 



_ roO.485 Q-0.498 qO.996 q-1.0 
— Oil ^533 ^34 Si7 



qI.O q-0.007] 
Oe7 ^114 



(30) 



minimizes the impact of the errors due to composition, opac- 
ity and diffusion inputs (0.47% error) and therefore opti- 
mizes the determination of nuclear cross sections at solar 
core temperatures. 

The formalism used here is very quick in testing the ro- 
bustness of the method to changes in the assumed errors on 
the nuisance parameters. We have checked that the optimal 
observable does not significantly change by modifying the 
errors on some of the nuisance parameters. For example, if 
we double the errors on the abundance ratios, the power in 
fCSe) changes from 2.098 to 2.099 and the error from 0.47% 
to 0.92%, while if we double the errors in theoretical inputs 
like opacity and diffusion, the power in /(^Be) changes from 
2.098 to 2.105 and the error from 0.47% to 0.48%. 



3.4 The expansion history of the universe from 
Cosmic Clocks 

A direct method to determine the expansion history of the 
universe is to use cosmic clocks ( [Jimenez fc Loeb_2002J . The 
determination of the Hubble parameter is done by estimat- 
ing the differential age of "clocks", namely passively evolv- 
ing galaxies. In order to estimate the differential age one 
approach is to use the spectral feature around the 4000A 
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break (Moresco et al.||2011i, although it is desirable to ex- 



d4000 



o-d 



ploit the whole galaxy spectrum ( Jimenez et al. 2003 Simon, 



Verde, & Jimenez 2005 Stern et al. 20101. This feature. 



called D4000, depends both on the age and also on metal- 
licity of the stellar population. Although the dependence of 
7)4000 on the metallicity is weaker than the dependence on 
age, the metallicity is usually poorly known, and thus acts 
as a nuisance parameter in obtaining the differential age. To 
a good approximation the D4000 feature can be written as 

D4000 ex age^Z*^, (31) 

where Z is the metallicity of the "clock" and a, /3 are the ex- 
ponents of the corresponding power-laws. For spectra which 
have a poor signal to noise there will be a large systematic 
error on Z due to the fact that spectral lines will be harder 
to identify. This systematic can be modeled to be a signal- 
to-noise dependent shift ^'"^'"'"'-'^d = A(7)Z*''"'= where 7 
is the signal to noise ratio. Of course there might be other 
systematics affecting what we have called Z^^^'' , but this is 
beyond the scope of this paper. The biggest error is expected 
to be the systematic shift in the estimated metallicity A (7), 
which we model as a power-law A (7) — 67'', where we have 
parametrized our ignorance with O and h (Moresco et al 
2011 1. We can rewrite this as 



A(7) = QB 



log(7i) 



(32) 



where B = e . We then describe the "cosmic clocks" in the 
following way: 

d, = agefZ''(eB'°'^'^'')'^ for each i, (33) 



where di is the label corresponding to a given absorption 
line (the D4000 feature in this example), the index i runs 
over galaxies that have approximately the same metallicity, 
Z is some central value of the metalicity, 7^ is the signal 
to noise ratio of each measurement, and /? is some known 
power-thus if the original sample shows a wide range of 
metallicity it should be split in metallicity bins before it 
can be described by Eq. |33[ We now wish to find quantities 
which are independent of the nuisance parameters: Q and 
B. In this case the total derivative condition Eq. [2] becomes 



de 



= E 



ddi 



-Pi-0, 



dS 



JV 

= E 



- ^ -/31og(7i)— =0. 



ddi ' 



B 



which, if /fc is a power-law fk = Hi '^i\ '^^ simply 

N 



N 



E ^^ ^°g(^* 



0. 



(34) 



(35) 



(36) 



(37) 



We have done this explicitly for the example data in 
table [2] for 10 galaxies. This sample has characteristics not 
too dissimilar from those of the next H{z) release (Moresco 
et al. in prep.). The resulting combinations are the eight 
that satisfy 

69 = -(-306i-3062-30fe3-264-365-566-15&7-2368)(38) 
5 



2.00 


0.03 


0.15 


100 


1.95 


0.03 


0.2 


100 


1.90 


0.03 


0.25 


100 


1.81 


0.05 


0.4 


72 


1.82 


0.05 


0.5 


73 


1.76 


0.05 


0.6 


75 


1.83 


0.03 


0.7 


85 


1.70 


0.025 


0.9 


93 


1.60 


0.04 


1.05 


75 


1.50 


0.05 


1.28 


70 



Table 2. Measurements of the spectral feature around the 4000A 
break for different galaxies. These are examples of the types of 
data that one should obtain, and the results from actual meas- 
ruements are expected to be similarly distributed. We show the 
measured value of the D4000 feature in the first column, the er- 
ror on each measurement on the second column as given by its 
dispersion, the redshift z of the galaxy in the third column, and 
the signal to noise of the spectrum in the fourth column. 



61 62 63 ^4 bs t'6 b-j bs 



bg 



bio 



1 























-6 


5 





1 




















-6 


5 








1 

















-6 


5 











1 














-2/5 


-3/5 














1 











-3/5 


-2/5 

















1 








-1 























1 





-3 


2 























1 


-23/5 


18/5 



Table 3. E xp licit solutions of equations [34| and ^5] satisfying con- 



ditions 38 
in tablets 



as in table [21 i.e. i 
i = 2 to the second. 



and 39 for /j, = Yii d^' i using the measurements given 
k is the row index, and the column index i is ordered 
= 1 correspond to the first row in that table, 
and so on. 



610 — 5bi + 5^2 + 563 



3&4 
5 



265 



4-267 + 



(39) 



We explicitly give a set of 8 independent solutions in table 
[3] One may then use these combinations to produce robust 
measurements of the redshift as a function of cosmic time 
and apply it for example to the case of dark energy. Of course 
this cancelation of the systematics depends on the way we 
model them, but there is no method to treat completely 
unknown systematic errors. This method can be generalised 
to the more general case where the full spectrum is used 



(Jimenez et al.|p003 Simon, Verde, fc Jimenez||2005 Stern 



et al.|2010 l, although in this case the effect of the metallicity 
as a systematic will be smaller. 



3.5 Hubble diagram from SNla: Removing second 
parameter dependence 

The above example was motivated by the cosmic clocks 
problem, but to show that the solution found is general 
enough to be applied in different contexts, we consider a dif- 
ferent problem in this section. Supernovae type lA (SNla) 
are considered to be standard candles, so that, for a sample 
of supernovae spanning a wide redshift interval their appar- 
ent magnitude can be used to infer the luminosity distance 
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as a function of redshift and thus constrain cosmological pa- 
rameters. 

In reality, SNla are standardizable candles: an empir- 
ical algorithm ( |PhillipS[ 1993j relates peak luminosity and 
stretch of the light curve. Using this procedure one can ex- 
ploit SNla as if they were standard candles; the scatter 
around the luminosity distance relation gets dramatically 
reduced, and thus obtain a luminosity distance cIl using the 
distance modulus /i: 



fj, = m — M = 25 + 5 log d^ 



(40) 



where (11 encloses the dependence on cosmological param- 
eters, m denotes the SN apparent magnitude and M the 
absolute magnitude. The distance modulus is related to the 
peak supernova magnitude in r— band by 



mpcak = Mstand - Ce{s - I) + fJ.{z) + K(s, z) + Ar 



(41) 



where Mstand is the standardized peak B absolute magni- 
tude, a is the Phillips (1993) parameter relating stretch and 
peak luminosity, s is a typical stretch, ii{z) is the distance 
modulus, Ar is the r band Milky Way extinction along the 
supernova's line of sight and K{s,z) is an approximate K- 
correction to the r band at z = 0. Of course, one could 
use any optical band besides r. All these corrections have 
a residual scatter and there is evidence that some of the 
residual scatter could be due to some "second parameter" 
(e.g., [Aubourg et al.| (|2008|); [SuIII^an et al.| (|20Tot; [Brandt 



et al. 



(2010)) such as metallicity of the host galaxy. There 



are thus two sources of errors: a) the fact that, after all the 
corrections have been applied the average absolute magni- 
tude M, although common to all supernovae in the sample, 
is unknown and b) that there is a residual scatter around 
M which could, in principle, not be purely statistical but 
depend on extra parameters. 

Thus we can recast the above problem as 



M + AMi 



' fj-i 



(42) 



where i = 1,N runs over the SN in the sample, AMi in- 
clude all residual systematic errors, rhi the (k-corrected, 
extinction corrected, stretch-corrected) observable quanti- 
ties and /Xi enclose the cosmological dependence and is the 
quantity of interest. As these are logarithmic quantities, the 
reader will inmediately realize that this problem is similar 
to the cosmic clock one where log A = AMi, Z = log M and 
log a = Hi. One can completely cancel the dependence on 
M by taking linear combinations fk = X^i biirii that satisfy 



i:6?=o. 



(43) 



similarly to the BAO case. One thus obtains combinations 
that are robust under changes in the estimated value of M . 
We can do even better and attempt to cancel the re- 
maining systematics AMi. There are indications that the 
absolute magnitude depends also on the metallicity of the 
supernova (or host galaxy) and that this is the leading con- 
tribution to AMi, so we will assume that AMi depends on 
the host galaxy metallicity Z and model AM(Z) as a poly- 
nomiafj of some power i. 

^ The degree of the polynomial can be tuned such that it provides 
a good description to the actual data. 



AM(Z) = ei(2--Zo) + e2(^-^o)^ + ... + e,(Z-Zo) , (44) 

where Qi for i — !,...,£ and Zo are unknown parameters. 
This can also be thought of as a Taylor expansion of the 
dependence of AA'I on Z. This polynomial can be explicitly 
expanded and rewritten in the following form 



AM{Z) = 60-1- QiZ + QiZ^ + ... + QeZ'^ 



(45) 



where Qi are combinations of Oj and Zq. There is an ongo- 
ing effort to measure the metalicities, so that here we take 
them to be part of our space of observables. Therefore, our 
nuisance parameters will be M and Qi for i = 0,l,...,^. We 
find that the total derivative conditions can then be written 



d/fc d/fc .^-^ dfk 

^ OC =r- OC > — 

dOo dM ^ drrii 

^ 2 — 1 



dOi 

d/fc 
dOz 



JV 

E 



dfk ^^ 
drhi 

dfk y2 

ami 



dfk 
dQi 



= E 



dfk 
drhi 



Zl = 0. 



(46) 

(47) 
(48) 

(49) 



The solution to the set of differential equations is given by 
a linear combination fk ~ '^^^^rhi, where the coefficients 
h\ are the TV — ^ — 1 that satisfy the following conditions 



JV 

Yj^z. 

i = l 

JV 



0, 



Y^'^' 



(50) 
(51) 
(52) 

(53) 



We expect the results obtained here to be robust also under 
changes in the Qi, M , and Zq. 

Now let us compare the solution provided by this 
method with the standard approach. When analyzing a SN 
sample it is customary to marginalize over M as follows: 
for each choice of cosmological parameters (a) /^(^O is com- 
puted and used to extract M{ol); AMi then encloses the 
statistical error. One can see that this procedure is equiv- 
alent to marginalize over i/o- In fact one can rewrite the 
apparent magnitude as: 



= A/ - 5 log Ho -H 25 -H 5 log(J/orfL) 



(54) 



This procedure cannot be unbiased if the set of cos- 
mological models/parameters scanned does not include the 
true underlying model. The solution provided by the present 
method instead cancels out the dependence on M{a) provid- 
ing an unbiased answer, and can also mitigate the impact of 
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remaining systematics (such as the influence of the metalic- 
ity of on the absolute magnitude) . Of course this cancelation 
of the remaining systematics depends on the way we model 
them, but there is now way of treating completely unknown 
systematic uncertainties. 



3.6 CMB angular power spectrum. 

The angular power spectrum computed from the auto- 
correlation of a map produced by a given detector is given 
by CT""" = Cf"" + Cr""" where C?°"" denotes the power 
spectrum of the detector noise and is called "noise bias": 
a poor knowledge of the noise bias will bias the estimate 
of the CMB power spectrum. On the other hand the cross 
power spectrum for maps produced by different, uncorre- 
lated, detectors does not have noise bias. In this subsection 
we wish to reproduce this well-known fact in the light of our 
approach. This is a simple example of a problem which is 
neither linear nor a power-law, and can thus show how to 
apply our approach in more general cases. 

Let us model this case as having two detectors that 
give two different measurements of the temperature pertur- 
bations of the CMB with uncorrelated noise 



-,(2) 



= atr 
= air 

,(1,2) 






(55) 



where aj,^ are the temperature anisotropics decomposed 
in spherical harmonics measured by each detector, airn are 

(1 2) 

the "true" temperature anisotropics, and A'^^^ is the noise 
of each detector. We wish to apply the conditions given in 
Eq. PI to this case. Since there is a different contribution 
of the noise per each m, it would appear that the problem 
has no solution as there are as many nuisance parameters 
as observables. However not all of them are independent 
since they are uncorrelated among themselves and with the 
"true" temperature anisotropics, and must thus satisfy the 
following constraints 



(56) 
(57) 
(58) 



One way to approach the problem is to use all these con- 
straints in order to remain only with independent quantities. 
Though this gives the desired result, it is rather involved al- 
gebraically. We will instead use Lagrange multipliers and 
impose that the total derivative of the following function 





= 0, 


m 




E<'«^'"^ = 


= 0, 


Tn 




E^^mffl^'m = 


= 0. 



(1,2) 



with respect to each N j,^ 
independent 



be zero while keeping all of them 



,(59) 



/fc+A 


^JE< 




E<'«- 


. + A^^ 


y:^^ 


at 




m 




m 






m 




which 


I gives the following 


two 


equations 








+ Aiflfm 


+ A.^^ii' 


= 


0, 








dfk 

da?2 


1 xfe 

+ Aiaim 


+ ^'sN^/2 


= 


0. 









(60) 
(61) 



Imposing that the fk depend explicitly only on a\^ im- 
A3, and the differential equations can be 

= 0, (62) 

= 0, (63) 



plies Al = A2 = 
rewritten as 

^/fc , \fc„(2) 
dfk 



da 



(2) 



+ '^1 "fm 



with the well known solution 



fk = -A?^42a 



(2) 



(64) 



where any value of Ai is equally good since it is simply a 
global factor. We have thus recovered the fact that the cross 
correlations are independent of the noise. 



4 CONCLUSIONS 

We have presented an algorithm to minimise, or even com- 
pletely cancel out, the effect of systematic uncertainties (nui- 
sance parameters) that can somehow be modelled, even if 
the modelling is very rough or approximate. The method 
was inspired by renormalization group techniques, and this 
interpretation provides an elegant description of what nui- 
sance parameters are: we identify the physics that is invari- 
ant under arbitrary rescalings of the nuisance quantities. 
This nuisance-independent physics is completely character- 
ized by the nuisance scaling dimensions of the observables. 
In the limit case where nuisance can be totally washed out 
this recipe will unravel the underlaying responsible scale in- 
variance. 

Our general approach is given in Eqs. (1-2) and we re- 
port explicit recipes for cases where the observables have 
power law and linear dependences on nuisance parameters 
in Eqs. (11) and (12) respectively, which apply also if only 
an approximate cancellation of the nuisance parameter is 
possible. Additional constraints on nuisance parameters or 
observables can be included (Eq. 14). The algorithm is gen- 
eral and can be applied to any experiment or observation. 
However, because of our own field of expertise, we have cho- 
sen to illustrate the method with some examples drawn from 
astrophysics and cosmology. In doing so we have discovered 
some interesting new results that can be used to analyse 
observations from large scale galaxy and supernova surveys. 
For the case of baryonic acoustic oscillation experiments, 
we provide an optimal way to combine the observed quanti- 
ties to reduce systematic uncertainties in the sound horizon 
scale Vs. For supernova surveys we have provided a method 
to remove a dependence of the Hubble diagram of system- 
atic uncertainties due to metallicity variability of the SN (or, 
as a proxy, that of the host galaxy). Finally, we show how 
for the cosmic clock method the feared age-metallicity de- 
generacy can be completely removed by choosing adequate 
combinations of the observed quantities. We chose to focus 
on these techniques because they are the most promising 
to unveil the nature of dark energy; removing or minimiz- 
ing the dependence of this methods on poorly constrained 
systematics is crucial to be able to gain full advantage of 
the significant observational effort that is being invested in 
observational cosmology. As we have emphasized before our 



method is general and we hope it wiU be used in other areas 
of the experimental sciences. 



APPENDIX A: THE GENERAL PROBLEM 

In this appendix we wish to suggest an interpretation of 
Eq. [2] that might prove useful when solving more general 
problems. As in the main text, we consider A'^ observables 
Oi{fj,i, ..., fj-m ,v\, ■■■,1'n) that depend on m accurate or inter- 
esting quantities ^i and n "nuisance" or "biased" parame- 
ters Vi. Let us keep the quantities /ii fixed at some value, 
and assume that there are more observables than nuisance 
parameters N > n. The functions Oi{vi, ..., Vn) then define a 
mapping between R" and R'^, which under certain smooth- 
ness assumptions defines a manifold (an n-dimensional hy- 
persurface embedded in R^). If this manifold is orientable, 
there will be at least one vector field v] orthogonal to it, and 
in general there can be up to A'^ — n such vector fields. The 
integrals of these vector fields (which exist at least locally) 
will be constant on the hypersurface 



-dOi 



''^7^ = 2^ 



df dOi 
dOi dvj 



0, 



(Al) 



where fk is the integral of v^ . This is the same as equatiouM 
The problem is then to find such vector fields and integrate 
them. 
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